Multiuser Co-Inference With Batch Processing Capable Edge Server
نویسندگان
چکیده
Graphics processing units (GPUs) can improve deep neural network inference throughput via batch processing, where multiple tasks are concurrently processed. We focus on novel scenarios that the energy-constrained mobile devices offload to an edge server with GPU. The task is partitioned into sub-tasks for a finer granularity of offloading and scheduling, user energy consumption minimization problem under latency constraints investigated. To deal coupled scheduling introduced by concurrent we first consider offline constant same constraint. It proven optimizing policy each independently aggregating all in one optimal, thus independent partitioning sub-task (IP-SSA) algorithm inspired. Further, optimal grouping (OG) proposed optimally group when different. Finally, future arrivals cannot be precisely predicted, deterministic gradient (DDPG) agent trained call OG. Experiments show IP-SSA reduces up 94.9% setting, while DDPG-OG outperforms DDPG-IP-SSA 8.92% online setting.
منابع مشابه
FINITE POPULATION SINGLE SERVER BATCH SERVICE QUEUE WITH COMPULSORY SERVER VACATION
A single server finite population queueing model with compulsory server vacation and with fixed batch service has been considered. For this model the system steady state probabilities are obtained. Some performance measures are calculated and numerical examples are also given.
متن کاملtwo server queueing system with single and batch service
a two server queueing system with single and batch service is considered in this paper. the arrival process is assumed to be poisson and the service rate follows an exponential distribution. server-i serves the customers in both single and batch service, while server-ii serves the customers in batch service only. the laplace transform of the transient and steady state behavior of the model is c...
متن کاملOn edge Co-PI indices
In this paper, at first we mention to some results related to PI and vertex Co-PI indices and then we introduce the edge versions of Co-PI indices. Then, we obtain some properties about these new indices.
متن کاملDesigning a Multiuser HDTV Storage Server
Future advances in networking coupled with the rapid advances in storage technologies will make it feasible to build a HDTV-on-demand server (that provides services similar to those of a neighborhood videotape rental store) on a metropolitan-area network. In this paper, we present a quantitative study of designing a multi-user HDTV server, and present efficient techniques for (1) storing multip...
متن کاملQueue dependent additional server queueing problem with batch arrivals
— We consider in this paper the steady state behaviour of a Queueing System with queue length dependent additional server facility wherein arrivais occur in batches of variable size. Whenever the queue length in front of the first server reaches a certain lenght, the system adds another server. Steady state probabilities and expected queue lengths in Single Server System and Additional Server S...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Transactions on Wireless Communications
سال: 2023
ISSN: ['1536-1276', '1558-2248']
DOI: https://doi.org/10.1109/twc.2022.3192613